DOCUMENT RESUME 



ED 351 382 



IM 019 214 



AUTHOR 
TITLE 

INSTITUTION 
SPONS AGENCY 
REPORT NO 
PUB DATE 
CONTRACT 
NOTE 

PUB TYPE 

EDRS PRICE 
DESCRIPTORS 



IDENTIFIERS 



Nandakumar , Ratna 

Assessing Essential Dimensionality of Real Data, 
Illinois Univ., Urbana* Dept* of Statistics* 
Office of Naval Research, Arlington, Va« 
1992-2; 0NR-4^21--548 
5 Aug 92 

N00014-90-J-1940 

31p.; Paper to be published in "Applied Psychological 
Measurement . 

Reports - Evaluative/Feasibility (142) 
MF01/PC02 Plus Postage. 

Ability; ^Computer Simulation; Evaluation Methods; 

^'Item Response Theory; Mathematical Models; 

^^Psychological Testing; '^Test Items 

Ability Estimates; Data Sets; ^Dimensionality 

(Tests); ^^DIMTEST (Computer Program); Stouts 

Procedure 



ABSTRACT 

The capability of the DIMTEST statistical test to 
assess essential dimensionality of the model underlying item 
responses of real tests as opposed to simulated tests was 
investigated. A variety of real test data from difference sources was 
used to assess essential dimensionality. Based on DIMTEST results, 
some test data are assessed as fitting an essential unidimens i onal 
model, while others are not. Essential unidimens i onal test data, as 
assessed by DIMTEST, are then combined to form two-dimensional test 
data. The power of Stout *s statistic T is examined for the 
two-dimensional data. It is shown that the results of DIMTEST on real 
tests replicate findings from simulated tests in that the statistic T 
discriminates well between essential unidimens ional and 
multidimensional tests and is also highly sensitive to major 
abilities while being insensitive to relatively minor abilities 
influencing item responses. Five tables present analysis results, and 
38 references are included. (Author/SLD) 



Vf it Vc :V :V :V Vc Vr V: Vr :V i: :'f :'r :V Vr Vc :V it it it it i: it it it it it it it it it it it it it it it it it it it it it :V it it it it it ie it it it it it it it it it it it it it 

Reproductions supplied by EDRS are the best that can be made 
from the original document. 

it i< i( i( Vc it i( itieici<i<ici(i<iKiiiKiti(it it it it it i< it it -k it it it it it it it it it it it it it it it it it it it i< it it it it it i: i; it it it it it it it it ie ie it it 



U.«. 0€^AirT«KNTOf BOOCATWH 
Off«c« of Educaltootl R«Mtrch aixl tmpfovtminl 

EDUC^ATIONAL RESOURCES INFORMATION 

/ CENTER (ERIC) 

m This docum«nl hat b*«n r*produc*d as 

racaivtd from the p«r»on or organization 

origirvaiir^ it 
□ Minor chants havt b«an mad* lo improve 

raproduclKKi Quality 

• Point* of viawof opinions stated m this docu- 
mant do not nacesMrity raPratant offtciat 
OEPI potitton or pottcy 



Assessing Essential Dimensionality of Real Data 



Ratna Nandakumar 
Department of Educational Studies 
University of Delaware 



August 5, 1992 



Prepared for the Cognitive Science Research Program, Cognitive and Neural Sciences 
Division, Office of Naval Research, under grant number N00014-90-J-1940, 4421-548. Ap- 
proved for public release, distribution unHnuted. Reproduction in whole or m part is 
permitted for any pu;-pose of the United States Government. 



RcrURl UUtUm tINi 1 A 1 lv/l>l rMOC 


Form Approved | 
0MB No. 0704-0188 1 


gaihenng and maintaining the data Serv.cw, DIreaorate for information Operation* and Reports. 1215 Jefterson 1 
JfJi^rrn^J^irCr'S;: aX^^^^ «'<^9et. paperwork Reduction PrOiea (0704^188). Washington. DC 20503. 


1. AGENCY USE ONLY (Leave blank) 


2. REPORT DATE 

5 August 1992 


3. REPORT TYPE ANE 

Technical : 


) DATES COVERED 1 
1990-93 


4. TITLE AND SUBTITLE 

Assessing Essential Dimensionality of Real Data 


5. FUNDING NUMBERS 1 

N0001^'-90-J-19AO, j 


6. AUTHOR(S) 

Ratna Nandakumar 


TTeRFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) 

Department of Statistics 
University of 1 11 Inois 
72^ South Wrfaht Street 
Champaign, IL 61820 


8. PERFORMING ORGANIZATION 1 
REPORT NUMBER j 

1992 - No. 2 


■9. SPONSORING/MONITORING AGENCY NAME(S) AND ADDRES5(ES) 

Cognitive Sciences Program 
Office of Naval Research 
ouu \i • \iu 1 ncy 
Arlingston, VA 22217-!5000 


10. SPONSORING/ MONITORING 
AGENCY REPORT NUMBER 

t 1 M « r- 1 n 1 

^421-5^8 


11. SUPPLEMENTARY NOTES 

To be published in Applied Psychological Measurement* Sof tw 
out procedure available from authors 


are to carry 


12a. DISTRIBUTION /AVAILABILITY STATEMENT 

Approved for public release; distribution unlimited 


12b. DISTRIBUTION CODE 



13. ABSTRACT (Maximum 200 words) 

See reverse 



14. SUBJECT TERMS 

See reverse 



17. SECURITY CLASSIFICATION 
OF REPORT 



unclassified 



18. SECURITY CLASSIFICATION 
OF THIS PAGE 

unclassified 



19. SECURITY CLASSIFICATION 
OF ABSTRACT 



unclassified 



15- NUMBER OF PAGES 
25 



16. PRICE CODE 



20. LIMITATION OF ABSTRACT | 
UL 



NSN 75flO-01-280-5500 



Standard Form 298 (Rev 2 

Prescribed by anSi Sid Z39-^8 
298-102 



89) 



ASSESSING ESSENTIAL DIMENSIONALITY-2 



Assessing Essential Dimensionality of Real Data 



Abstract 



The purpose of this article is to validate the capability of DIMTEST to assess 
essential dimensionality of the model underlying the item responses of real tests as opposed 
to simulated tests. A variety of real test data from different sources are used to assess 
essential dimensionality. Based on DIMTEST results, some test data are assessed as fitting 
an essential unidimensional model while others are not. Essential unidimensional test data, 
as assesse ' V DIMTEST, are then combined to form two-dimensional test data. The 
power of Stout's statistic T is examined for these two-dimensional data. It is shown that 
the results of DIMTEST on real tests replicate findings from simulated tests in that the 
statistic T discriminates well between essential unidimensional and multidimensional tests. 
It is also highly sensitive to major abilities while being insensitive to relatively minor 
abilities influencing item responses. 

Subject terms: DIMTEST, essential independence, essential dimensionality, 
unidimensionaiity, multidimensionality, item response theory. 



ERLC 



4 



ASSESSING ESSENTIAL DIMENSIONALITY-^ 



Most of the currently used item response theory (IRT) models require the assumption 
of unidimensionality. From the strict IRT perspective, unidimensionality refers to one, and 
only one, trait underlying test items. Yet, it is a well known fact that items are multiply 
determined (Humphreys, 1981, 1985, 1986; Hambleton & Swaminathan, 1985, chap. 2; 
Reckase, 1979, 1985; Stout, 1987; Traub, 1983). Hence from the substantive viewpoint, the 
assumption of unidimensionality requires that the test items measure one dominant trait. 
Stout (1987) coined the term essential unidimensionality to refer to a particular 
mathematical formulation of a test having exactly one dominant trait. Dimensionality is, 
however, determined by the joint influence of test items and examinees taking the test 
(Reckase, 1990). In addition, extraneous factors such as teaching methods, anxiety level of 
examinees, etc., may also influence the dimensionality of the given item response data. 
Thus dimensionality has to be assessed each time a test is administered to a new group of 
examinees. 

Factor analysis has traditionally been the most popular approach to assess 
dimensionality (Hambleton & Traub, 1973; Lumsden 1961). Factor analysis, despite its 
serious limitations to analyze dichotomous data (for example, see Hulin, Drasgow, and 
Parsons, 1983, chap. 8), has been the popular method to study the robustness of the 
unidimensionality assumption (Drasgow & Parsons 1983; Harrison, 1986; Reckase, 1979). 
There are a number of other promising methods proposed and used in varying degrees to 
assess dimensionality — ^to name a few: full information factor analysis based on the 
principle of marginal maximum likelihood (Bock, Gibbons, & Muraki, 1985; TESTFACT: 
Wilson, Wood, & Gibbons, 1983); nonlinear factor analysis (McDonald, 1962; McDonald & 
Ahlawat, 1974; Jamshid & McDonald, 1983); Holland and Rosenbaum's (1986) test of 
unidimensionality, monotonicity and conditional independence based on contingency 
tables; Tucker and Humphreys' methods based on the principle of local independence and 
second factor loadings (Roznowski, Tucker, & Humphreys, 1991); and Stout's (1987) 



ASSESSING ESSENTIAL DIMENSION ALITY-4 



statistical procedure based on essential independence and essential dimensionality. Hattie 
(1984, 1985) has provided a comprehensive review of traditional approaches to assess 
dimensionality, and Zwick (1987) has applied some of the above mentioned recent 
procedures to assess dimensionality of National Assessment of Educational Progress data. 
Despite having several procedures available to assess dimensionaUty, there is no widespread 
consensus among substantive researchers for a preference for any method(s), and often 
there is dissatisfaction about assessing dimensionality (Berger & Knol, 1990; Hambleton & 
Rovinelli, 1986; Hattie, 1985). 

Stout (1987) proposed a statistical cest (DIMTEST) to assess essential 
unidimensionality of the latent space underlying a set of items. Nandakumar (1987) and 
Nandakumar and Stout (in press) have further modified, refined, and validated DIMTEST 
for assessing essential dimensionality on a variety of simulated tests. This article 
demonstrates the validity and usefulness of Stout's procedure on a variety of real, as 
opposed to simulated, tests. Test data £rom different sources are collected and used to 
assess essential unidimensionality. Essential unidimensional data are then combined to 
form two--dimen8ional data. The power of Stout's statistic T is examined for these 
two-dimensional data. 

DIMTEST for Assessing Essential Unidimensionality 

DIMTEST, a statistical test for assessing unidimensionality, is based on the theory of 
essential dimensionality and essential independence (Stout, 1987, 1990). An item pool is 
said to be essentially independent with respect to the latent trait vector Q, if, for a given 
initial segment of the item pool, the average absolute conditional (on covariances oi 
item pairs approaches zero as the length of the segment increases. When only one dominant 
ability 0 meets the essential independence assumption, the item pool is said to be 



ERLC 



6 



ASSESSING ESSENTIAL DIMENSIONALITY-5 



essentially unidimensional. In contrast, the assumption of local independence niqmres the 
conditional co^ariances to be zero for all item pairs in question. The number of abilities 
required to satisfy the local independence assumption is the dimensionality of the test. 
While the traditional definition of dimensionality (Lord & Novick, 1968) counts all abilities 
required to respond to test items correctly to satisfy the assumption of local independence, 
essential dimensionality counts only dominant abilities required to satisfy the assumption 
of essential independence (as opposed to local independence). DIMTEST, using this 
definition, assesses the closeness of approximation of the model generating the given item 
responses to the essential unidimensional model. Nandakumar (1991) describes the 
theoretical differences between traditional dimensionality and essential dimensionality and 
establishes through Monte Carlo studies the usefuhiess of DIMTEST for assessing essential 
unidimensionality in the possible presence of several secondary dimensions. 

To use DIMTEST for assessing essential unidimensionality, it is assumed that a 
group of /examinees take an iVitem test. Each examinee j)roduces a vector of responses of 
Is and Os, with 1 denoting a correct response and 0 denoting an incorrect response. It is 
assumed that essential independence with respect to some dominant ability 0 holds and 
that the item response functions are monotonic with respect to the same vector 0. The 
hypothesis is stated as follows: 

H : drp = 1 versus H : drp > 1 

where denotes the essential dimensionality of the latent space underlying a set of items. 

In order to assess essential unidimensionality of a given test data, DIMTEST follows 
several steps. The steps are summarized briefly here (for details see Stout 1987; 
Nandakumar & Stout, in press). First, test items are split into three subtests ATI, AT2, 
and PT with the aid of factor analysis (FA) using part of the sample (a sample size of 500 



7 



ASSESSING ESSENTIAL DIMENSION ALITY-6 



is recommended for this purpose). Items of ATI are selected so that they all tap the same 
dominant ability. Instead of using FA, it is also possible to use expert opinion (EO) to 
select items for ATI. If the FA method of selection is chosen, DIMTEST automatically 
determines the length of the subtest ATI. Once items for ATI are chosen, items of AT2 
are selected so that they have a difficulty distribution similar to those of ATI items (for 
details see Stout, 1987). The remaining items form the partitioning subtest PT. 

Second, examinees are assigned to K different subgroups ba£ed on their score on the 
partitioning subtest PT. In other words, all examinees obtaining the same PT total score 
are assigned to the same subgroup. When the subtest PT is "long" and the test is 
essentially unidimensional, within each subgroup fc, examinees are assumed to be 
approximately of similar ability. When PT is not long, the subtest AT2 compensates for 
the bias in ATI caused by PT being short. Also, AT2 compensates for the bias in ATI 
caused by the presence of guessing or the difficulty factor that is often found by the factor 
analysis. 

^2 "^2 

Third, within each subgroup k, variance estimates, aj^ and and the standard 
error of estimate Sj^ are computed using item responses of ATI. These estimates are then 
simimed across K subgroups to obtain 



'2 ' 2 



Similarly, Tj^ is computed using items of subtest AT2. Stout's statistic Tis given by 
T = (T^-T^/[^. 



The decision rule is to reject H^ii T> Z^, where is the upper lOO(l-a) percentile of the 



ASSESSING ESSENTIAL DIMENSION ALITY--7 



standard normal distribution, a being the desired level of significance. 

When the given test data are well modeled by an essential unidimensional model, 
items of ATI, AT2, and PT would all be tapping the same dominant dimension. Therefore, 

A A 

the variance estimates a| and cr ^ ^ will be approximately equal resulting iu a "small" 
T-value, suggesting the tenability of H^. On the other hand, when the test data is not well 
modeled by an essential unidimensional model, the variance estimate will be much 
larger than cr^^ resulting in a "large" T-value leading io the rejection of H^. 

Simulation studies (Stout, 1987; Nandakumar, 1987; Nandakumar & Stout in press) 
on a wide variety of tests have demonstrated the utility of DIMTEST in discriminating 
between one- and two-dimensional tests. Simulation studies by Nandakumar (1991) have 
particularly demonstrated the usefulness of DIMTEST in assessing essential 
imidimensionality with the aid of a rough index of deviation from essential 
xmidimensionality. The tests in Nandakumar (1991) were modeled by two- and 
higher-dimensional IRT models as opposed to a one-dimensional model, and the test items 
were influenced by major and secondary abilities to varying degrees. For some tests, the 
secondary ability or abilities influenced a high proportion of items, and for others the 
secondary ability or abilities influenced only a small proportion of items. It has been shown 
that DIMTEST reliably accepts the hypothesis of essential unidimensionality, provided the 
model generating the test is close to the essential unidimensional model: established when 
each of the secondary abilities influences relatively few items, or if secondary abilities are 
influencing many items, the degree of influence on each item is small. The type-I error in 
these cases was within tolerance of nominal level. As the degree of influence of the 
secondary abilities increases, however, the approximation to an essential unidimensional 
model degenerates, inflating the observed type~I error of the hypothesis of essential 
unidimensionality. Simulation results (Stout, 1987; Nandakumar and Stout, in press) have 
particularly demonstrated the excellent power of the statistic T when the model generating 




ASSESSING ESSENTIAL D:MENSI0NALITY-« 



the item responses is twcMiimensional (two major abilities) with correlation between 
abilities as high as .7 and items jointly influenced by both abilities. 

Description of Data 

The data sets used in the present study came from different sources. The U.S. history 
and literature data for grade 11/age 17, from the 1986 National Assessment of Educational 
Progress (NAEP, 1988) test data, were obtained from Educational Testing Service (ETS). 
The General Science data. Arithmetic Reasoning data, and Auto Shop Information data for 
grades 10 and 12, from the Armed Services Vocational and Aptitude Battery (ASVAB) 
test data, were obtained from Linn, Hastings, Hu, and Ryan (1987). The Mathematics 
Usage test data, the science test data, and the reading te&t data were obtained from 
American College Testing program (ACT). 

The NAEP achievement tests are part of the so called Balanced Incomplete Block 
(BIB) design with spiraled administration (Rogers et al., 1988) which allows the study of 
interrelationships among aU items within a subject area. Because the U.S. history and 
literature tests fall into the simplest category of BIB design, it was relatively easy to 
gather the response data for ail examinees taking these tests. Hence, these tests were 
chosen for the present study. The items in each area (history and literature) were divided 
into four "parallel" blocks with approximately the same number of items. One block of 
items out of four was randomly selected in each case for the present study. 

The U.S. history test data (HIST-A) with 36 items consists of items requiring 
knowledge from different time periods of U.S. history: Colonization to 1763; the 
Revolutionary War and the New Republic, 1763-1815; Civil War, 1815-1877; the rise of 
modern America, World War 1 1877-1920; the Depression, Worid War H, 1920-1945; 
Post-World War II, 1945-to the present; and map items requiring the knowledge of 



10 



ASSESSING ESSENTIAL DIMENSION ALITY-9 



geographical location of different countries in the world. A 31--item subtest of HIST-A, 
named HIST was created (explained in detail in the next section) consisting of all the items 
of HIST--A, except the five map items. There are 2428 examinees in the HIST-A and HIST 
samples. 

The literature test data (LIT) with 30 items consists of items requiring knowledge 
within four literary genres: novels, short stories, and plays; myths, epics, and Biblical 
characters and stories; poetry; and nonfiction. There are 2439 examinees in the LIT sample. 

The ASV AB tests are used by the Department of Defense Student Testing Program 
in high schools and post secondary schools. The Arithmetic Reasoning test data for grades 
10 and 12, with 30 items each, consists of items requiring knowledge in solving arithmetic 
word problems. The arithmetic reasoning test sample for grade 10 (ARIO) has 1984 
examinees, and for grade 12 (Ara2) has 1961 examinees. The Auto and Shop Information 
test data for grades 10 and 12, with 25 items, each consists of items requiring knowledge of 
automobile, tools, and shop terminology and practices. The auto shop test sample for grade 
10 (ASIO) has 1981 examinees, and for grade 12 (AS12) has 1974 examinees. The General 
Science test data for grades 10 and 12^ with 25 items sach, consists of items requiring 
knowledge in solving high school level physical, life, and earth sciences. There are 1990 
examinees in the general science test sample for grade 10 (GSIO) and 1990 examinees in the 
general scien e grade 12 (GS12) sample. 

The ACT mathematics usage test data (MATH) with 40 items consists of items 
requiring knowledge in solving different types of mathematics problems: arithmetic and 
algebra operations, geometry, numeration, story problems, and advanced topics. There are 
2491 examinees in the MATH sample. 

The ACT reading test data (READ~A) with 40 items consists of 4 passages, each 
followed by 10 questions. The first three passages are taken from different books all dealing 
with humanities, and the last passage is taken firom a book about psychology. The first 



11 



ASSESSING ESSENTIAL DIMENSIONALITY-10 



passage came irom Of the Farm by Jolm Updike. The second passage came from Li ght and 
Color in Nature and Art by Samuel Williamson and Herman Cummins, The third passage 
came from Theatre: the Dynamics of the Art bj- Brian Hansen. And the fourth passage 
came from Toward a Psychology of Being by Abraham Maslow. A 30-4tem subset of 
READ-A named READ was created (details in the next section) consisting of the first 30 
items of READ-A. There are 5000 examinees in the READ-A and READ samples. 

The ACT science test data (SCI-A) with 40 items consists of 7 passages, each 
followed by 5 to 7 questions. The first passage dealt with the effect of the thymus gland on 
the deyelopment of immune system in mice» The second passage dealt with sub-surface 
ground water moyement and its effects for waste disposal. The third passage dealt with the 
periods of the pendulum on the earth and the moon and its relationship to the string length 
and mass of the ball. The fourth passage dealt with the environmental impact of effluent. 
The fifth passage dealt with a bimetallic caialyst and its relationship to the speed of 
certain chemical reactions. The sixth passage dealt with the views of two paleontologists on 
the characteristics of dinosaurs. And the seventh passage dealt with the principals of 
osmosis and osmotic characteristics of 3 categories of organisms. A 28-item subset of 
SCI-A named SCI was created (explained in the next section) consisting of the first 28 
items of SCI-A. There are 5000 examinees in SCI-A and SCI samples. 

In addition, in order to examine the effect of sample size on DIMTEST, both SCI 'and 
READ are randomly split into four mutually exclusive data sets. The READ is split into 
READl, READ2, READ3, and READ4— with 750, lOOO, 1250 and 2000 examinees, 
respectively. Similarly SCI is split into SCIl, SCI2, SCI3, and SCI4— with 750, 1000, 1250, 
and 2000 examinees, respectively. In all there are 22 test data. These are listed along with 
the test size and sample size in the first three columns of Tables 1 and 2. 



12 



ASSESSING ESSENTIAL DIMENSI0NALITY--11 



Creation of Two-DiiiLeiisioiial Test Data 

Three different sets of two-dimensional test data from the content perspective were 
created by combining responses from test datu. that were assessed as essentially 
unidimensional by DIMTEST in the present study. 

The two-dimensional test data, RS, was created by combining responses of 30 items 
of READ with the responses of 6 items of SCI forming a 36-4tem test with 5000 examinees. 
The 6 items of SCI are part of one of the passages randomly selected from its 5 passages. 
Just as in the unidimensioiial case of READ and SCI, RS is then randomly split into 4 
mutuaUy exclusive data sets RSI, RS2, RS3, and RS4— with 750, 1000, 1250 and 2000 
examinees, respectively. These tests are listed along with their test sizes and sample sizes 
in the first four columns of Table 3. 

The two-Kiimensional test data ARGSl, for Grade 10, was created by combining the 
responses of 30 items horn ARID with the responses of 5 items (randomly selected from 25 
item responses) from GSIO. Similarly, ARGS2 was created by combining the responses of 

30 items from ARID with the responses of 10 items from GSIO. The two-dimensional test 
data GSARl, for gradel2, was created by combining the responses of 25 items from GS12 
with the responses of 5 items fcom AR12; and GSAR2 was created by combining the 
responses of 25 items horn GS12 with responses of 10 items from AR12. These test data are 
listed along with their test sizes and sample sizes in the first four columns of Table 4. 

The two-dimensional test data HSTLITl was created by combining the responses of 

31 items from HIST with the responses of 5 items (randomly selected from 30 item 
responses) from LIT. Similarly HSTLIT2 and HSTLIT3 were created by combining the 
responses of 31 items from HIST with the responses of 8 and 10 items, randomly selected, 
from LIT respectively. These test data are listed along with their test sizes and sample 
sizes in the first four columns of Table 5. 



13 



ASSESSING ESSENTIAL DIMENSIONALITY-12 



Results 

Unidimenfflonal Studies 

All the tests in Table 1, except HIST, READ, and SCI (which are derived subtests of 
HIST-A, READ-A, and SCI-A, respectively as described below), were initially tested for 
essential unidimensionality using DIMTEST. In each case, 500 examinees were randonily 
selected ^:om the given pool for the use of selecting A-Tl items, using factor analysis. The 
rest of the items were used for computing Stout's statistic T. The size of ATI (M) was also 
determined by DIMTEST. For each test, the T-value and the p-value are noted. Table 1 
lists the T- and p-values for aU tests in the fourth and fifth columns. The method of 
selection of the ATI subtest, the value of Af, and item numbers selected for ATI are listed 
in the last three columns of Table 1. 



Table 1 about here 



It can be seen from Table 1 that the p-values associated with test data LIT, ARIO, 
AR12, GSIO, and GS12 are well above the nominal level of significance (Qf=.05), thereby 
strongly affirming essential unidimensional nature of these tests. That is, the underlying 
model generating the test data is judged essentially unidimensional. However, the p-values 
associated with HIST-A, ASlO, AS12, MATH, READ-A, and SCI-A are well below the 
nominal level of significance of .05, thereby strongly affirming the multidimensional nature 
of these test data. For these tests where p~values were below the nominal level, the nature 
of multidimensionality was further explored. 

H 



ASSESSING ESSENTIAL DIMENSION ALITY-13 



When the test data are essentially unidimensional, items of ATI are, by logic, of the 
same dominant dimension as the rest of the items; therefore, DIMTEST does not reject the 
nnll hypothesis. When the test data is not unidimensional, however, the items of ATI are 
dimensionally different from the rest of the items, and DIMTEST rejects the null 
hypothesis of essential unidimensionality. Following this reasoning for tests where ]>-values 
were very low, the content of items of ATI were examined. Table 1 shows that for 
HIST-A, items 12 through 16 and item 6 were selected for ATI. Upon studying the content 
of these items, it was found that items 12 through 16 were homogeneous and differed 
dimensionally from the rest of the items of HIST-A; these 5 items require the knowledge of 
location of different countries on the world map (map items), while the rest of the items 
deal with U.S. history. It is also possible in theory that these items were selected for ATI 
due to chance alone. In order to test for this, DIMTEST was applied on the given sample of 
2428 examinees 100 times repeatedly, each time randomly splitting 2428 examinees into 
two groups of 500 and 1928 examinees. That is, ATI items were selected repeatedly on 
differCTit random samples of 500 examinees each. The resampling results showed that items 
12 through 16 were consistently selected for ATI. In addition to these items one or two 
more items, which varied from run to run, were selected from the rest of the items. Hence 
it was concluded that the map items are dimensionally different from the rest. A subset 
HIST was formed consisting of all items of HIST-A except for map items. It can be seen 
from Table 1 that the p-value associated with HIST (p=.095) shows evidence of essential 
unidimensionality. Furthermore, from the content perspective, items of ATI do not form a 
set that is dimensionally different from the rest of the items of HIST. 

A similar phenomenon was observed with test data READ-A and SCI-A. For 
READ-A, the last 10 items (items followed by the last passage) formed part of subtest 
ATI. Again these same 10 items formed part of ATI in repeated resampling applications of 
DIMTEST. Upon studying the content of these items, it was found that these 10 items 



ASSESSING ESSENTIAL DIMENSIONALITY-14 



tapped "psychology" content area which is different from the "literature," tapped by the 
first three passages. Another possibility is that, since these are the last 10 items of reading 
test, speededness could have caused the secondary dimension. Based on these observations, 
it was concluded that these items were dimensionally different from the rest, and a subset 
READ was formed consisting of first 30 items of READ-A. It can be seen from Table 1 
that the p-value associated with READ (p=:.32) shows strong evidence of an essential 
unidimensional model underlying the test items. In addition, items of ATI now come from 
aU the passages of READ. 

For test data SCI-A, the 12 items following the last two passages formed part of 
ATI. Just as in HIST-A and READ-A, after resampling application of DIMTEST, these 
items were removed. The resulting subtest SCI with the first 28 items was still found to be 
multidimensional (p=.002). Thus, a unidimensional subset could not be formed. Unlike 
reading test items, science test items come from distinctly different content areas, with a 
moderate correlation among content areas, and require a higher level of abstract reasoning 
and analytical skills than the reading items. Thus, in addition to content areas, difficulty 
or speededness could have caused major secondary dimensions in this case. 

For the test data MATH, ASIC, and AS12, where p-values were low, items of ATI 
did not form a subgroup tapping a secondary ability as found in HIST-A, READ-A, or 
SCI-A. In addition upon studying the content of the items, it was found these items tap 
multiple major content areas. Therefore these test data are treated as multidimensional. 



Table 2 about here 



Table 2 shows dimensionality results of the unidimensional READ and 



lb 



ASSESSING ESSENTIAL DIMENSI0NALITY~15 



multidimensional SCI test data for different sample sizes. The j?-values associated with 
READl through READ4 show evidence of a high degree of essential unidimensionality 
underlying the test data. These results are consistent with that of READ in Table 1. The 
selection of items of ATI for tests READl through READ4 are highly varied, and yet they 
consistently affirm essential unidimensionality. The results of SCIl through SCI4 are 
consistent with that of SCI in Table 1 in affirming multidimensionality of the test data. 
Items of ATI varied highly for all four tests and yet consistently affirmed 
multidimensionality, except for SCI3. 

Two-dimensional Studies 

Results of two-dimensional reading and science test data are reported in Table 3. 
Since items that tap a distinct second dimension, from the content perspective, are clearly 
known (in this case, 6 SCI items), the science items were forced to be selected for ATI. 
This is an example where expert opinion is used to select ATI items. The T- and p-values 
for RSI, RS2, RS3, RS4, and RS strongly confirm the two-dimensional nature of these test 
data. As expected, as the sample size increases, the power also increases. 



Table 3, Table 4 and Table 5 about here 



The results of the two-dimensional test data of ARCS an. GSAR are reported in 
Table 4. Also in this case, since items that are used to create these two-dimensional data 
are known (GS items for ARCS and AR items for GSAR), these items were forced to be 
selected for ATI. The T- and jHvalues associated with all the four tests strongly confirm 



17 



ASSESSING ESSENTIAL DIMENSION ALITY-16 



the multidimensionality of these test data. For ARGSl and ARGS2, there is a sharp 
increase in T- and p-values as the degree of contamination, as measured by the number of 
item responses contaminated, increases from 5 to 10. 

The results of the two-dimensional history and literature test data are reported in 
Table 5. As with other two-dimensional tests, LIT items were forced to be selected for 
ATI. Also in this case, the T- and ^--values confirm the multidimensional nature of these 
data. 

DIMTEST was again applied to a sample of test data selected from two-dimensional 
tests. This time FA was used as the method of selection for ATI items. The purpose of this 
analysis was to check if the FA method of selection of ATI items would lead to the similar 
p-values as with EO. The findings revealed that for these tests FA could not always ferret 
out purely unidimensional items fi:om content perspective. The subtest ATI had a mixture 
of items tapping both dimensions, and DIMTEST was then able to correctly assess 
dimensionality only when there were 1000 or more examinees for computing the statistic. 

Discussion and Condusions 

None of the tests examined in the present study are strictly unidimensional in the 
sense of measuring only one ability. Items, in every test, are influenced by several 
secondary abilities in addition to the major ability intended to be measured. Based on 
DIMTEST analysis, some test data were assessed as fitting an essential unidimensional 
model while others were not. This depends upon whether the secondary abilities were major 
or minor. 

The unidimensionality analysis of HIST-A, READ-A, and SCI-A present interesting 
findings. For HIST-A, the map items had high second factor loadings and thus were 
selected for ATI. Consequently, the computed T-statistic wa''; large, leading to the 

is 



ASSESSING ESSENTIAL DIMENSIONALITY-17 



rejection of and implying that ATI items are dimensionally different from the rest of 
the test. Content analysis of HIST-A reveals that HIST-A consists of items of United 
States history for different time periods spanning from 1763 to present time. These items 
cover such a large span of time that the test is surely slightly multidimensional for this 
reason alone. In addition, the test contains map items. The map items, however, were 
isolated and statistically confirmed as not measuring the same trait as the rest of the test. 
This shows that the statistic T is highly sensitive to distinct major dimensions (in this 
case, map items). The analysis of HIST, with map items removed, reveals that it is 
essentially unidimensional. Thus the statistic T seems to be robust against relatively minor 
correlated abilities influencing test items while being sensitive to major abilities. Likewise, 
for the test data READ-A, multidimensionality was caused by items tapping psychology 
topic (scientific) versus literature topics (humanities). Once the psychology item responses 
were removed, the remaining item responses could be well modeled by an essential 
unidimensional model. In contrast, the multidimensionality in SCI-A was due to not only 
distinct major abilities but also likely due to speededness of the test, which in itself is a 
major determinant. Moreover, an essential unidimensional subtest could not be formed for 
SCI-A. 

Another interesting feature of these analyses is that although both READ and SCI 
are paragraph comprehension type test data, they differ widely in the degree of their 
approximation to essential dimensionality. The READ test data has 3 passages each 
followed by 10 items, aU dealing with humanities. Although these passages come from 
different sources, the model underlying the item responses approximates an essential 
unidimensional model. This is an example where a few secondary abilities (possibly highly 
correlated) each influence a large group of items. In contrast, the SCI test data has 5 
passages each followed by 5 or 6 items. These passages, although they deal with science in 
general, come from widely different and conceptually difficult topics, and the model 



IS 



ASSESSING ESSENTIAL DIMENSIONALITY-18 



underlying the item responses does not approximate an essential uni dimensional model. 
This is an example where many secondary abilities each influence a small groups of items, 
but the strength of the influence of these secondary abilities is such that item responses can 
not be well modeled by an essential unidiraensional model. These results are consistent 
with simulation results of Nandakumar (1991) in that the number of iten h- x-uenced by 
secondary abilities and the strength of the secondary abilities present determine the degree 
to which the assumption of essential unidimensionality is violated. 

The results obtained in this study are similar to the results obtained by other 
researchers who have analyzed some of these data using different statistical methodologies. 
Zwick (1987) performed dimensionality analyses of HIST-A and LIT by various techniques 
to assess dimensionality and concluded that these are unidimensionai. Regarding the ACT 
data, it is believed that MATH and SCI are multidimensional. Bock, Gibbons, and Muraki 
(1985) have analyzed ASVAB test data for a different sample and found a significant 
second factor for arithmetic reasoning, general science, and auto shop information. Since 
the sample used here is not the same it is hard to develop a meaningful comparison. 

The results of two-dimensional tests demonstrate a very good power of the statistic 
T. The statistic T has the capability to ignore minor secondary traits, which should be 
largely discounted, from the major dominant traits. This is evidenced in several cases. The 
test data HIST illustrates this. There is inherent multidimensionality in HIST as it covers 
a range of time periods in history. However, the p-value is above the nominal level of 
significance, suggesting acceptance of unidimensionality. By contrast, with the additional 
contamination of only 5 LIT items or 5 map items, the T-value shoots up, indicating 
essential multidimensionality of the data. This remarkable sensitivity of the statistic T to 
major dimensions illustrates its power. 

These results, for the first time, have illustrated both the factor analysis approach 
and the expert opinion approach to select items for the subtest ATI. Tables 1 and 2 use FA 



ERLC 



20 



ASSESSING ESSENTIAL DIMENSIONALITY^IQ 



to select ATI items, and Tables 3, 4, and 5 use EO. It is evident that FA serves as an 
exploratory tool and EO serves as a confirmatory tool in selecting items for ATI to assess 
essential dimensionality. 

The dimensionality of a given set of item responses in certain sense is a 
continuum — one cannot determine whether a given data of responses generated by a set of 
items to an examinee sample is truly essentially unidimensional or truly multidimensional; 
one can only approximate. Although the exact number of dimensions in an IRT model is 
rigorously defined for a finite length test, the number of dominant dimensions — ^whether 
determined by Stout's essential dimensionality conceptualization or by some other 
conceptualization — is only rigorously definable for an infinitely long test. In other words, 
for a finite test (that is, for any real test data) it is a judgment call whether a particular 
IRT model is seen as having one, or more than one, donainant dimension, based upon where 
on the continuum the amount of multidimensionality falls. One consequence of this is that 
the performance of ability estimation procedures such as LOGIST or BILOG needs to be 
addressed in the context of the assessment of the amount of lack of unidimensionality. In 
this regard, indices of lack of essential unidimensionality developed by Junker and Stout 
(1991) will be extremely useful. These indices can be used to decide when it is safe to use 
unidimensional estimation procedures such as LOGIST and BILOG to arrive at accurate 
estimates of ability. 

In cases where approximation of essential unidimensional model to the data is in 
question, there are various alternatives. The test items can be split into essential 
unidimensional subtests (for example, HIST-A and READ-A). Another possible approach 
is to investigate the applicability of the concept of "testlet" to the data (Hosenbaum, 1988; 
Thissen, Steinberg, and Mooney, 1989). If the assumption of local independence is violated 
within the passages but maintained among the passages, the theory of testlets promises 
unidimensional scoring for such tests. The test data SCI-A and SCI could fall into this 



2i 



ASSESSING ESSENTIAL DIMENSION ALITY-20 



category. Multidimensional modeling can be applied if either of the above procedures can 
not be applied (Reckase, 1989). 



ASSESSING ESSENTIAL DIMENSIONALITY-21 



Refiereiices 



Berger, M. P., & Knol, D. L. (1990). On the assessment of dimensionality in 

multidimensional item response theory models . Paper presented at the annual AERA 
meeting, Boston. 

Bock, R. D., Gibbons, R., & Muraki, E. (1985). Fullninformation item factor analysis 
(MRC Report No. 85-1). Chicago: National Opinion Research Center. 

Drasgow, F., & Parsons, C. (1983). Applications of unidimensional item response 

theory models to multidimensional data. Applied Psychological Measurement , 7, 189-199. 

Hambleton, R. K. & Swaminathan, H. (1985) Item Response Theory : Principles and 
appUcations (p. 19). Kluwer-Nyjhoff Publishers, Boston. 

Hambleton, R. K., & Traub, R. E. (1973). Analysis of empirical data using two logistic 

latent trait models. British Journal of Mathematical and Statistical Psychology. 24, 
273-281. 

Hambleton, R. K., & Royinelli, R. J. (1986). Assessing the dimensionality of a set of test 
items. Applied Psychological Measurement , IQ, 287-302. 

Harrison, D. (1986). Robustness of IRT parameter estimation to yiolations of the 
unidimensionality assumption. Joamal of Educational Statistics , U, 91—115. 

Hattie, J. (1984). An empircal study of yarious indices for determining unidimensionality. 
Multiyariate Behayioral Research , IS, 49-78. 

Hattie, J. (1985). Methodology reyiew: Assessing unidimensionality of tests and items. 
Applied Psychological Measurement , 2, 139-164. 

Holland, P. W., & Rosenbaum, P. R. (1986). Conditional association and 

unidimensionality in monotone latent yariable models. Annals of StaMstics , 14) 1523—1543, 

Hulin, C. L., Drasgow, F., & Parsons, L. K. (1983). Item Response Theory (p. 235). Dow 
Jones— Irwin: Homewood, Illinois. 

Humphreys, L. G. (1981). The primary mental ability. In M. P. Friedman, J. P. Das, k 

N. O'Connor (Eds). Intelligence and learning (pp. 87-102). New York: Plenum Press. 

Humphreys, L. G. (1985). General intelligence: An integration of factor, test, and 

simplex theory. In B. B. Wolman (Ed.), H andbook of intelligence . John Wiley, New York. 

Humphreys, L. G. (1986). An analysis and eyaluation of test and itrm bias in the 
prediction context. Journal of Applied Psycholog y, 71, 327-333. 

Jamshid. E., & McDonald, R. P. (1983). A second generation nonlinear factor analysis, 
Psychometrika , 48, 315-342. 

Junker, B., & Stout, W. (1991). Structural robustness of ability estimates in item response 



?3 



ASSESSING ESSENTIAL DIMENSIONALITY-22 



theory . Paper presented at the 7th European Meeting of the Psychometric Society, Trier, 
Germany. 

Linn, R. L., Hastings, N. C, Hu, G., & Ryan, K. E. (1987). Armed Services Vocational 

A ptitude Battery: Differential item functioning on the high school form , Dayton, OH: 
USAF Human Resources Laboratory. 

Lord, F. M., & Novick, M. R. (1968). Statistical theories of mental test scores . Reading 
Massachusetts: Addison-Wesley. 

Lumsden, J. (1961). The construction of unidimensional tests. Psychological Bulletin , 
55, 122-131. 

McDonald, R. P. (1962). A general approach to nonlinear factor analysis. Psvchometrika . 
4, 397-^15. 

McDonald, R. P., & Ahlawat, K. S. (1974). Difficulty factors in binary data. British 
Journal of Mathematical and Statistical Psvcholog Vy 27, 82-89. 

Nandakumar, R. (1987). Refinements of Stout's procedure for assessing latent trait 
dimensionality . Unpublished doctoral dissertation. University of Illinois, 
Urbana-Champaign. 

Nandakumar, R. (1991). Traditional dimensionality vs. essential dimensionality. 
Journal of Educational Measurement . 2S, 1—19. 

Nandakumar, R. & Stout, W. F. (in pressV Refinements of Stout's procedure for assessing 
latent trait dimensionality. Journal of Educational Statistics. 

National Assessment of Educational Progress (1988). User Guide: 1985-86 Public-use data 
tapes . National Assessment of Educational Progress, Princeton, NJ. 

Reckase, M. D. (1979). Unifactor latent trait models applied to multifactor tests: 
Results and implications. Journal of I^4ucational Statistics . 4, 207-230. 

Reckase, M. D. (1985). The difficulty of test items that measure more than one 
ability. Ap plied Psychological Measurement , 9, 401--412. 

Reckase, M. (1989). The interpretation and application of multidimensional item 

response theory models; and computerized testing in the instructional environment . Office 
of Naval Research Technical Report (N00014-85-C-0241). 

Reckase, M. D. (1990). Unidimensional data from multidimensional tests and multidimensional 
data from unidumensional tests . Paper presented at the annual AERA meeting, Boston. 

Rogers, A. M., Kline, D. L., Norris, N. A., Johnson, E. G., Mislevy, R. J., Zwick, R., Barone, J. 
L., Kaplan, B. A. (1988). National Assessment of Educational Progress, 1985-86 
Public-use data tapes, version 2.0: USER GUIDE . Educational Testing Service, 
Princeton, NJ. 

Rosenbaum, P. R. (1988). Item bundles, Psvchometrika , 52i 349-359. 



ASSESSING ESSENTIAL DIMENSIONALITY-23 



Rcznowski, M. A,, Tucker, L. R., & Humphreys, L. G. (1991). Three approaches to 

detennining the dimensionality of binary data. Applied Psy chological Measurement. .1^, 
109-128. 

Stout, W. F. (1987). A nonparametric approach for assessing latent trait dimensionality. 
Psvchometnka , 52, 689-^17. 

Stout, W. F. (1990). A new item response theory modeling approach with applications 
to imioimensional assessment and ability estimation. Psvchometrika , 293-326. 

Thissen, D., Steinberg, L., & Mooney, J. (1989). Trace lines for testlets: A use of 

multiple-categorical-response models. Journal of Educational Measurement , 26, 247-260. 

Traub, R. E. (1983). A priori considerations in choosing an item response model. In R. 

K. Hambleton (Ed.), Applications of item response theory . British Columbia: Educational 
Research Institute of British Columbia. 

Wilson, D. Wood, R. L., & Gibbons, R. (1983). TESTE ACT: Test scoring and item 
factor analysis [computer program.] Cmcago: Scientific Software. 

Zwick, R. (1987). Assessing the dimensionality of NAEP reading data. Journal of 
Educational Measurement , 24, 293-308. 



Acknowledgements 



The author thanks Bill Stout and Brian Junker for their helpful comments and many suggestions 
on this research, and Mark Reckase and Tim Miller for proyiding ACT data. 



Table 1 

Results of H^: = 1, a = .05 



Test 


No. of 

it Pins 


No. of 


T 


D 
r 


ill 

Selection of 
ATI items 


M 


Items of 
ATI 




ou 


2428 


fi IQ 


00001 

• V/V/V/V/ X 


FA 

X A 


6 


6.12.13.14.15.16 








X • ox 


0Q5 


FA 

X A 


5 


7.23.24.26.30 


T.TT 


ou 


^*iOi7 


71 

« 1 X 


234 


FA 

X A 




5 9 18.20.22.26 


AIIJ.V/ 








727 


FA 

X 1%. 


6 


1 .3.4.5.6.8 


AR12 


OV/ 


1Qfi1 


64 


.260 


FA 

X n 


4 


1.4.6.14 

X 9 x-x 


UU J. V/ 




1QQ0 


96 


.168 


FA 


5 


4.16.19.23.25 


GS12 

VJ ki J. Ad 




J. i700 


-.26 


.601 


FA 


6 


14.15.19.23.24.25 


ASIO 


25 


1Q81 


2.27 


.012 


FA 


5 


4.16,19.23,25 


AS12 


25 


1974 


3.64 


.000 


FA 


5 


3,4,8,14,22 


MATH 


40 


2491 


2.79 


.003 


FA 


10 


1,5,25,27,29,30 
















32,34,35,39 


READ-A 


40 


5000 


8.67 


.00001 


FA 


10 


31,32,33,34,35,36, 
















37,38,39,40 


READ 


30 


5000 


.48 


.32 


FA 


7 


1,2,6,11,12,13,21 


SCI-A 


40 


5000 


3.19 


.0007 


FA 


12 


29,30,31,32,33,34 
















35,36,37,38,39,40 


SCI 


28 


5000 


2.97 


.002 


FA 


5 


2)3}5)8)12 



ATI items can be selected by using factor analysis (FA) or by expert 
opinion (EG) . 

M is the size of ATI 



Table 2 









Results 




= 1, 0 = 


.05 




Test 


No. of 


No. of 


T 


V 


Selection of 


M 


Items of 




items 


examinees 




ATI items 




ATI 


READl 


30 


750 


.05 


.480 


FA 


5 


11,12,13,15,17 


READ2 


30 


1000 


.48 


.317 


FA 


7 


1,2,6,11,12,13,21 


READS 


30 


1250 


-.06 


.524 


FA 


7 


2,4,6,9,11,12,13 


READ4 


30 


2000 


1.01 


.155 


FA 


5 


1,11,12,13,16 


sen 


25 


750 


1.89 


.029 


FA 


7 


1,3,4,5,17,20,21 


SCI2 


28 


1000 


3.19 


.007 


FA 


6 


8,12,14,18,20,24 


SCI3 


28 


1250 


1.38 


.080 


FA 


7 


6,9,10,11,19,25,28 


SCI4 


28 


2000 


2.91 


.001 


FA 


7 


8,9,10,11,12,19,22 



Test 



RSI 
RS2 
RS3 
RS4 
R8 



Table 3 

Results of H^i dg=l for two-disaensional tests: 
READ & SCI; a=.05 



No. of 
Items 
RAED SCI 



No. of 
Examinees 



30 
30 
30 
30 
30 



6 
6 
6 
6 
6 



750 

1000 

1250 

2000 

5000 



1.92 
2.72 
3.71 
3.32 
6.83 



Selection of 
ATI items 



.020 

.003 

.0001 

.0005 

.0000 



EO 
EO 
EO 
ED 
EO 



M 



8 
6 
6 
6 
6 



Items of 
ATI 



31,32,33,34,35,36 
31,32,33,34,35,36 
31,32,33,34,35,36 
31,32,33,34,35,36 
31,32,33,34,35,36 



Results of H^: d 



E 



Table 4 

= 1 for two-dimensional tests: 



ARfeGS; a=.05 



Test 


No. of 
Items 
AR 6S 


No. of 
Examinees 


T 


P 


Selection of 
ATI items 


M 


Items of 
ATI 


ARGSl 


30 


5 


1853 


2.85 


.002 


EO 


5 


31,32,33,34,35 


ARGS2 


30 


10 


1853 


6.15 


.000 


EG 


10 


31,32,33,34,35, 












36,37,38,39,40 


GSARl 


25 


5 


1811 


4.29 


.000 


EO 


5 


26,27,28,29,30 


6SAR2 


25 


10 


1811 


4.06 


.000 


EO 


10 


26,27,28,29,30, 












31,32,33,34,35 



Table 5 

Results of H^: d^=l for two-dimensional tests: 
HIST t LIT; a=.05 



Test No. of No. of T p Selection of M Items of 

Items Examinees ATI items ATI 

HIST LIT 



HSTLITl 31 5 2428 3.01 .036 EO 5 32,33,34,35,36 

HSTLIT2 31 8 2428 3.38 .000 EO 8 32,33,34,35,36, 

37,38,39 

HSTLIT3 31 10 2428 2.03 .021 EO 10 32,33,34,35,36, 

37,38,39,40,41 



STOLT.TCL n JAN 91 

FROM AiX.AREA, MSURM^^- 

Dr. Terry Actcnz\Mi 
Educatiofui Piycbotosr 
2«0C EduoiMM B1<J|, 
Univcnity oC Iffinoi* 

Dr. Terry An»fd 
Code IMXS 
CXTtce o( K«%:»l Records 
fOO N. OuifKy Sl 
Artmiioo, VA 22217.5000 

Dr. Nsncy AJWn 
EducaiiomJ Tc»tinji SenSoe 
Prirtc^ioo. NJ 06M1 

Dr. Grffciy Anri| 
ISdooikxul Tc»an| S«f%k« 

Princfion. SJ 06541 

Dr. Phipp« Ar>bi« 

Godujie School o( MsfUfcmcnt 

Ruijten Univcrwiy 

v2 S*«f* Street 

Nwork. NJ 07102.1895 

Dr. I. D<jAf 
Ljw Si-hool AdmoMon* 

S<.Tvicef 
Box 40 

Nc*io»ti. PA 19^0040 

Dr. William O. Bmy 
Direoof bf« and 

CfTN-irocmentaJ SdcAcei 
AFOSiCNU Nl, BW|. 410 
nollinj AFa DC 20J32.M4a 

Dr. Tbc<ti*i G. Sever 
Depunmcnt of Piycbolo0 
Unixemty of Rocbetttf 
Rrvcr Suitoo 
Rofbetter. NY 14^27 

Dr. McnuchA BircnbttKa 
EJuciiioful Tettinl 

Scrvirc 
Pnn^cion. NJ 0W41 

Dr. Bruce Dioioai 
Defense Manpcwer Dau Center 
w PacirK Sc 
Suite 15<A 
MonuTiy. CA 9WJ-32J1 

Dr G»\-neth Boodoo 
EJu«tien*l Tesung Service 
Princeton, NJ 08541 

Dr. Richard L Brvnc^ 
HO. USMEPCOHWEPCT 
1<<C Green Bjy Ro*4 
Nonh Ch^go. iL MOM 

Dr. Robefl Breonan 
American Cottefe Te*iin| 

Protrami 
P. O Boi lea 
]<^^ Ctt\. lA S220 

Dr D^id V. Bud«*cu 
Dep»nmcm of PjydK*>8f 
UnntniTy of Haifj 
Mount Carniet. Ha«ft 31999 
ISRAEL 

Dr. Gregory Cjodefl 
Cni'M^M.lUa'McGr**.Hi« 
25lC Garden Ro*d 
Mooitrey, CA 9>W0 

Dr. Paui ft Chatetor 

Perceptronici 

W \ Nonh Fl Mvef Dr. 

Su.le \)CC 

Ariiniton. VA 2220* 



Dr. Sumo Carman 
Copmivt Sdeoee Projna 
CKIkc of N«%«l Rocarcfa 
MO Honfa QiMocy Sc 
AHiniUMW VA 22217.5000 

Dr. Rjynood E. CbrMOl 

UES LAMP SeieiKe Advbor 

AL/HRMIL 

Brook* AFB.TX7t23J 

Dr. Nomtn OtCT 
DcpMtaent of Psycholofif 
Univ. of Sa Cjlifomia 
Lot Aat^ CA 90009.10(1 

Dirvctor 

Life Sdeoee Code 1142 
OC(k€ of K yfi Research 
A««n|ion. VA 22217-SOOO 

Co(iMnaod<n| Officer 
Navii Research LaboraUMy 
Code 4827 

WashmfloatX: 20375-5000 

Dr. John M. Cof7i^*«« 
Dcpirtmeni of PtycholoBf 
I/O Pfycboio0 ProfraiB 
Tuianc Uniwiity 
N<«Orkam.LA TOlli 

Dr. WiWafi) Crano 
Dcpafunent of Pfycho(o0 
Te» AkM Univerwty 
CoHcte S(Mio«x TX 77M3 

Dr. Linda Curran 

Dcfenae Manpower DaU Center 

Suite 400 

16O0 Wikon BK^ 

RoMJyn. VA 22209 

Dr. Timothy Davey 

American CoUc(r Teaiiot Proyan 

P.O. Box 161 

Ur^ Gty. lA 52243 

Dr. Chartei E. Davit 
Educational Teaiint Service 
Mad Stop 22-T 
PrifMtoM, NJ 08541 

Dr. Ralph J. DeAyala 
MeaturvmetM. Statitiict, 

$oA EvtlualkM 
BenjaoM B(d^ Ral 1230P 
Univcnity of Maryland 
Cotleie Pact MD 20742 

Dr. Sharon Deny 
Florida State Univenity 
DeparUDcnt of Piychok>0 
.FL 3230* 



Hei-Ki Vksnt 

Be«cort 

i Corporate PI 

RM: PYA.IK207 

P.O. Bat 132D 

PitcaMy. NJ 08855.132O 

Dr. Neil Oorant 
Edvc»tkmal TcMing Servici 
Prinoctoa NJ 08541 

Dr. Friu Drasfow 
Univeraity of IDinoit 
E>epaniDent of Pfychoio0 
M3 E. DanKi St. 
Cha«fiai|n. IL 

Dtfenae Technical 

InforaatioA Center 
CaaeroQ Station. Bidg 5 
Alexandria, V A 22314 
(2 Copies) 



Dr. Richard E>ursn 
Graduate School of Education 
UruMcn'' ' CaUfomia 
Sama Bar. . CA 931» 

Dr. SusM) [Lmbretaon 
Univerviy of KUmat 
Ptychok)^ E>epart»cm 
AU Fraaer 
Lmnce. KS «6(M5 

Dr. Oeorft En|eftacd. Jr. 
Dtvuioo of Educational Studict 
EfBoiy Uftivmity 
210 Ftabbumc Bid|, 
Atlanta, GA 30322 

ERJC Fadfity.Acquiaiiions 
2440 Research Blvd., Suite 550 
Rock^Alle. MD 20850-3238 

Dr. Manball J. Farr 
Farr.Sijht Co. 
2520 Nonh Vernon Street 
Artinr«V VA 22207 

Dr. Leonard Feldt 
Undquttt Cenier 

for Measurement 
UnKxTsity of Iwva 
Iowa Gty. lA 52242 

Dr. Richard U Ferguson 
American CoJIeje Teating 
P.O. Box 168 
lowi City. lA 52243 

Dr. Gerhard Fiacber 
Uebig^K 5 
A 1010 Menna 
AUSTRIA 

Dr. Myron Fiichl 

U.S Army Htadquanen 

DAPE.HR 

The Pentagon 

Wa.hiniion, DC 2O31O.0.VjO 

Mr. Paul Foley 

Naw Pcraoontl RkO Center 

Sao'Diejo. CA 92152 ^fOO 

Chair. Depanmeni of 
Computer Science 
Georfe Mason Univerwy 
Fairfax. VA 22030 

Dr. Robert D. Gibbons 
Univenity of Illinois at Cbicajo 
NPI 909A, ma: 913 
912 South Wood Street 
Chicago. IL ^12 

Dr. Janice CifTcrd 
University of MasMcfausctti 
School of Education 
Amherst. MA 01003 

Dr. Robert Glaser 
Lcamini Reiearcb 

k Desetepfnent Center 
Unfverairy of Piiuburfh 
3939 O'Hara Street 
PKUburgb, PA 15260 

Dr. Susan R. Goldman 
Peabody CoNege. Box 45 
Vanderhih Un^rcrsify 
NasimHe,TT4 37203 

Dr. Timothy Gotdsmitb 
Depanmcnt of Pfycbology 
Univenity of New Mcnco 
AJt^werque, NM 87131 



ERLC 



28 



Ol/27,^i2 



Dr. Sbmic Cou 
AFHRUMOSa 
BfooU AFD. TX 78235.5«l 

Dr. Ben Gr«en 
Jfthn* Hr»pkJn» Univtmry 
IVpanmcnt of Pi)Chok)|y 
Ch^rtct & Mih Stfctx 
a.liimorc MD 21211 

Pn^. Bd*-»ri H*cnd 
School of EdiKsiioo 
Sianford UnKxriity 
Siaof^Kd. CA W305-300< 

E>r. Ron;»U K. H»mbk(oo 
Unrvenity o( Mamcbuictu 
Laboojocy of Piychomctric 
tnd Ex-aluatfv« RcMtrcfa 
Hi'li South. Room 152 
Amhcnt. MA 010O3 

Df. Dck>-n HamUch 
Univcnity of Illinoi* 
51 Gerty Driv« 
Chjmpa.gn. IL <»1»20 

Or. Pat nek R. H»rmon 
Compuier SciefK* D<panm«nc 
U.S. Naval Ac*<Jcmy 
Annapolit, MD 21402-5002 

M*. Rcfcyecc* Hctier 

Nj\y Pcrtonnd R*D Onief 

Sin D.<no. CA 921524*00 

Or. ThomM M- HirKh 
ACT 

p. O Bex 16* 
Io«^ City. LA 52243 

Dr. Paul W. HoUMnd 
EJucaiional Twiinj S«fvic«, 21 -T 

PnnceioaNJ «S^1 

Prof, luu F. Homkc 
Intiiiut fur Pfychok>g>e 
RViTi! A>ch«n 
)^f.cmcM€ \V\9 
D-^HO Aactwn 
WEST GER.MANY 

M». Julia S Hough 
Cambridge Univcrwty Prett 
40 Weu 20th Sircct 
Nrsfc Yoft. NY 10011 

Dr. William Havti 
ChiW ScKfliia* 
AFHRUCA 

BrooU AFa TX 7S235.5«Ol 

Dr. Hjyiih HtJynh 
College of Educaiiofl 
UoA. of South Carolina 
Columbia, SC 2920« 

Or. Mamn 3. Ipp<* 

Ccnier for the Study of 

EJucanon and In»iruatoo 

LciJen Un»ver»4iy 

P. O Boi 955$ 

IVO RB Leiden 

TI!E NETHERLANDS 

Df. Robcn Jannarooc 
Eke and Computer Eng. Dept. 
Unfvem'y of Sooth Carolina 
Columbia. SC 29206 



Dr. Kumar Joag*<icv 
Unfvtnifjr of lillnoit 
DcpMtAcnt of SuiMika 
101 Ittni Hal 
72S South Wright 5(rccc 
Champtitn, IL 61S20 

Profetaor Doujja* R Jonea 
Gr»du*l« School of Maaigemcnl 
Rutgen, Tb€ Sate UiMvewity 

of firm iencf 
NcwartNJ 07102 

Dr. Brim Junker 
C»mep»-Mdk>o Univeniiy 
Dcpan««ni of Suiitiica 
Picuburgh, PA 15213 

Dr. Martcl JuU 
CartKfk-Menoo Univenity 
DcpartJDcni of Piycbotoor 
Schen»cy Part 
Piiuburih. PA 15213 

Dr. 3. L KaW 
Code 442/JK 

Naval Ocean S>-»iem« Center 
San Diego. CA 92152-5000 

Dr. MichaeJ Kaplan 
Office of Basic Reaearcfa 
US. Army Research Inaiitutc 
5001 Eii«nho*er Avenue 
AkonJm. VA 22333.5W0 

Dr. Jerenjy KiJpatrkk 
Department of 

Maihematjca Education 
705 Adcrhold HaU 
University of Georgia 
Aihw CA 30602 

Ml. Hac-Rtn KJfi 
Un'rvcnity of IHinott 
Depann>«nt of Suti»tk» 
101 lltini HaV 
725 South Wrifhi Sc 
Chanpaiin. IL 61S20 

Dr. Jwa4:eun Kim 
Department of P«ycho4ocr 
Middle Tennewce Slate 

UnA^niry 
Murfrcetboro, TN 37132 

Dr. Song-Hoon Kim 
KEDl 

914 Uogycon-Dong 

S«octvo-Gu 

Seoul 

SOUTH KOREA 

Dr. G. Gage Kingibory 

Portland PuWic Schoois 

Reaeardi and Eviluaiion Department 

501 North DiJKsn Street 

P. O. Boi 3107 

Portiwid. OR 9720^3107 

Dr. WiUiam Koch 
Box 724^ Meai. and EvaL Ctr. 
Unfveraify of Texaa-Auacin 
Auatin, TX 7B703 

Dr. JaoMt Kraau 
Computer* baaed Education 

RcMaixh Laboritoty 
Univenity of iSinoia 
UrbMU. IL 41801 

Dr. Patrick Kytionen 
AFHRUMOEL 
Brooki AFB. TX 78235 

Ml. CaroJyn Lmey 
1515 Spencervitk Rod 
SpencetvOk, MD 2066S 



Richard Lanierman 
Commandant (G PWP) 
US Cout Guard 
2100 Second St, SW 
Waahiogion. DC 20S93-0001 

Dr. Michael Lew>e 
l(duca«t^>nal Piychology 
210 Education Bld^ 
1310 South Sinh Street 
Univeraity of IL at 

U rbartaChampaign 
Champaign. IL 6l820.«990 

Dr. Charici Lc*ia 
Educational Tetting Service 
Princeton, NJ 06541^1 

Mr. H»in'hung Li 
UnHeniiy of lUinoJa 
Dcpanmeni of Suiiitic* 
101 lllini HaSI 
725 South Wright St 
Champaign, IL 61820 

Library 

Nj\al Training System* C<:nur 
12350 Reaearch Partway 
Orlando. FL 32826.32:4 

Dr. Marcia C Linn 
Graduate School 

of Education, EMST 
Tolrr.an HaH 
UnKeniry of California 
Berkeley. CA 94720 

Dr. Robert L Linn 
Campus Box 249 
Univeniiy of Colorado 
Boulder, CO 80309-0249 

Logicon Inc (Alin: Librarj-) 
Tactical a.id Tnining S>-»tem» 

Drvision 
P.O. Box &5158 
San Diego. CA 92138-5158 

Dr. Richird Lueehi 
ACT 

P. 0. n*u 16* 

Ia*a C\n. lA 52243 

Dr. George B. Macrcady 
Department of Measurement 

Statittia it Evaluation 
College of Education 
Univenify of Maryland 
College Part MD 20742 

Dr. Evani Mandea 
George Maaon Univenity 

4400 Univenify Drive 
Fairfax. VA 22030 

Dr. Paul Maybe rry 
Center for Naval Anal>n» 

4401 Ford Avenue 
P.O. Box 16268 
Akx:.ndri*. VA 2:1302.0:^6 

Dr. Jamea R. McBride 
HumRRO 

W.M) Elmhuni Drive 
San Diego. CA 92120 

Mr. Chriiiopher McCusker 
Univenity of Illinoia 
Department of Piycholoiy 
60} & Dan»el St 
Champaign. IL 41S20 

Dr. Robert McKinley 
Educational Teiting Service 
Princetoa NJ 08541 



P9 



ERIC 



BEST »V Ml 



Dr. Jotcph McL*cbUn 
Sa\y PenonfKl Ro«*rch 
and DcvdopmctM Center 

S*n D<|o. CA 921S2.«800 

AUn M(;»d 

c/o Dr. MichMi Levine 
EJucaiicm»l P«ycho(o0 
210 Education BUp 
Unr^cnity of iHinot* 

Df. Timothy Miller 

ACT 

r. O. Bn« 

Iv**» Ory. lA 52243 

Dr. Robcn M*Wvy 
Educaiion*! Tcsiinj Service 
PhfKeion. NJ 06541 

Dr. K-o Molenar 

Fitcuhek Socak WetetiKbappeti 

Ryk»un«>enijtek Grooin|en 

Grotc Kruiutnai 2/1 

V7l2 TS Groflingen 

The NETHLRLANDS 

Dr. E. Munki 
Educackxul Tciting Sen-See 
RcteUale Road 
PnfKetoa NJ 06541 

Dr. Rjtru Nin4«kus)K 
E4jucaitonal Siix^es 
Willard HaU. Rcx>«j 213E 
Uoftcniiy of Dcl**»re 
Nc»»rt DE W714 

Aodcfuk ProjjL A Retetrch Branch 
NmI Technic«r Trmifiing Command 
N- 2 

N/\S Meraphit (75) 
MitiingtoaTN 30654 

Dr. W. Aian Nicrwndcr 
Univcrwiy o( Oklahoma 
Dtfpartmenl of Piycbotoor 
Nofman. OK 7X71 

Head, Perwjnoei Syaiema Department 

NPRDC (Code 12) 

San Diego. CA 92l52-«0O 

Director 

Training S\-s<em« DcpMimcnt 

NPRDC (Code 14) 

&.n D<go, CA 92152-4800 

Library, NPRDC 
Code (Ml 

San D«ego. CA 92lS2-«800 
Ljt>f*nan 

Sav^l Center for Applied Reaearch 

in Artificial 1 nielli gen ce 
NVal Research Laboratoiy 
Code 5510 

Wa»h.ngioo, DC 2O37S.500O 

OnVce of Nj\-ai Rcaearch, 
Code n42CS 
fiii S. Oj»n<y Street 
Arhngion, VA 22217-5000 
(6 Cop*e») 

Special Asiiitan: for Retcarth 

.SlAnafcmeni 
Chief of Nas-al Peraoonel (PERS-OUT) 
Drpanmem of tb« Navy 
NKath.ngtofv DC 2O35O-2O00 

Dr. Judith Orasamj 

Mail Stop 239.1 

NASA Amea ReKtrch Center 

Mnffeit F*ld. CA (*"»35 



Dr. Peter J. Paiblqr 
Educational Teaiing Serwicc 
RoMrdalc Road 
Princtioo. NJ Q6S41 

Wayne M. P«tien.*e 
Amerkao CourKtl on Educatkm 
OED Tetiing Service, Suite 20 
One E>upooi Ordc, KW 
Waahingion. DC 2003« 

DepL of AdminisirMfvc Sckncei 

Code 54 
Nava< Po«tgr*duatc School 
MoiKerey. CA 93*4>502i 

Dr. Peter Pico* 
School of Education 
Unrvcnity o( California 
Betteky. CA 94720 

Dr. Mart D. RecUae 
ACT 

P. O. Box 166 

City. lA 52243 

Mr. Strvc Retac 
Department of Piycholosr 
Univenity of California 
RrvcrsidcCA 92521 

Mr. Louia Routio* 
Univeraity of Itliooia 
Deparrmem of Statiaika 
101 tUini HaN 
725 South WngM Sc 
Champaign, IL 41620 

Dr. Donald Rubin 
Stacialica Department 
Science Center. Room 606 
1 Otford Street 
Harvard Univenity 
ambndge. MA 02136 

Dr. Fufliiko Samejima 
Dcparuneni of Piychotofif 
Unrveraiiy of Tennessee 
310e AuMin Pety Bklg, 
KnojrviW*. TN 37964^)900 

Dr. Mary Schrau 
4100 Partside 
Carlabwl, CA 92008 

Mr. Robert Semmea 
N2:8 EMioli Han 
DepanoMnt of Paycbolcfy 
Univenity of Minneaoia 
Mifvwapolii. MN 55455-OM4 

Dr. VaJeric L Shalin 
Departjnent of Induatrial 

Engineering 
Slate Uofvcnify of New Yort 
342 Lawrence D. Befl Hall 
BufTala NY 142«0 

Mr. Richard J. S'hjvelaoo 
Graduate School of Education 
Untvenity of Caiifomia 
Santa Barbara, CA 93104 

Ma. Kathleen Sheehan 
Educational Testing Service 
Princetoa NJ 06541 

Dr. Kaiuo Shigcmatu 
7*9-24 Kugenuma*Kjipn 
Fujruwa 251 
JAPAN 

Dr. RandaH Shumaker 
Naval Research Labooiofy 

Code 5500 

4555 Overtook Avenue. S.W. 
Waahingion. DC 20375^5000 



Dr. Judy Spray 
ACT 

P.O. Box 168 
Iwa a:y. lA 52243 

Dr. Martha Stocking 
Edocalioo:»l Testing Scnke 
Princeton. NJ 06541 

Dr. WIHiam Stoui 
University of lUinoia 
DeparrmerK of Statistics 
101 inini HaH 
725 South Wright Sc 
Champaign. IL 41620 

Dr. Kikumi TaUuoka 
Educational Testing Service 
Mail Slop 0>T 
Princeton. NJ 06541 

Dr. David Tbisaen 
PsvchomctrSc Laboratory- 
CB# 3270. Davit HaH 
University of North Carolina 
Chapel Hill NC 27599-3270 

Mr. Thomas J. Thomas 
Federal Express Corporation 
Human Resource Development 
3035 Director Row. Suite 501 
Memphis. TN 38131 

Mr. Gary Thomassoo 
Unfversity of Illinois 
Educaiionat Psychology 
Champaign, IL 61S30 

Dr. H<»ard Wainer 
Educational TesUng SerN-ice 
Princeton. NJ 06541 

Eiizabeth Wald 

Onke of Nasal Technolojy 

Code 227 

800 Nonh Ouincy Street 
Arlington, VA 22217-5000 

Dr. .M»cha<l T. Walkr 
Untvrrsity of 

U1sconsin«Mi>*auk« 
Educational Pa)'cholop Dv\yL 
Boi 413 

M.kaukee. V>1 53201 

Dr. Mmg-Mei Wang 
Educational Testing Sernce 
Mail Stop OyT 
Princeton. NJ 06541 

Dr. Thomas A. Warn 
FAA Academy 
P.O. Box 25062 
Oklahoma Gty. OK 7311^ 

Dr. David J. Weisa 
N640 Ellkxi HaU 
Univenity of Minnesou 
75 E. RKer Road 
Minneapolis. MN 554S5-0>4t 

Dr. Douglas Weuel 
Code 15 

Navy Personnel R&D Center 
San Diego. CA 92I52 t>600 

German Miliury 
Represcntaiive 
Persona Isi^mmamt 
Koelner Str. 242 
D.5000 Koein 90 
WEST GERMANY 




SESTC 



Pi 



V.hool o( FJiiv-auofl 

anJ S»xial Policy 
Nonh*T»um Unf.cr»ity 

Or MriK-e Willum* 
IVlunmcni of EJocaiionjI 

Univerkify of liltnots 
Urtv.ina. IL cl80l 

IV M.irk WilvMi 
S»-h.'*M of rJuMlioo 
l'ni\cf>if\- of CaWomij 

Dr. Popcne N^'ioograd 

Ail..m4. OA W22 

Dr Mjnm F. Wukoff 
IM KSIIKEC 

»*» P.Kific Sl. Suite 4556 
Monicrc>-. CA 93940 

Mr. John H. Wolfe 

N.tvv Pcr>onnd RAD Cerner 

Son D-ego. CA 92l52-<>6f« 

Dr. Kenuro Yamamoio 

f:JiK3iioniJ Tetiing Scfvice 
Ro*eJ.»k Road 
i*r.nociuO. NJ 08541 

NU. Duanli Yin 
lvJ-v->iionil Telling, Sen ice 
Prir^.ci on. SJ («541 

Dr. Wendy Veo 
CTBM.-Crj* Hilt 
I\M Monie Reieirch Pari 
Mtmicrcy, CA 93940 

Dr. }^>*cyh L Young 
.Sjiioaal Science Foundation 

K.-»m ?:o 

iNii 0 Sirceu N.W. 
Ujfch.npi.Mi, DC 20550 



31 




